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SCOF\E Seaich Resuilts Details for Apphcaiion 
I 1 0563896 and Search Result 

I 2008 041 6„J 451 20^, us- 1 0-5 63 -896- 3. rag. 

Score Home Reirieve Application SCORE System SCORE Comments/ 

Page List Overview FAQ Suggestions 

This page gives you Search Results detail for the Application 10563896 and Search Result 
2008041 6_1 451 20_US-1 0-563-896-3. rag. 

Go.Mck.M.p.r.eyigus.p.age. 



GenCore version 6.2.1 
Copyright (c) 1993 - 2008 Biocceleration Ltd. 



OM protein - protein search, using sw model 
Run on: April 16, 2008, 14:55:47 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



Search time 120 Seconds 
(without alignments) 
1705.326 Million cell updates/sec 

US-10-563-896-3 
1807 

1 MGKSKEISQDLRKKIVDLHK EGYPKRLTQVKQFKGNATKY 340 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 3405708 seqs, 601879884 residues 

Total number of hits satisfying chosen parameters: 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post-processing: Min: 



Match 0% 
Match 100% 
Listing first 45 summaries 

A_Geneseq_200711 : * 
geneseqpl980s : * 
geneseqpl990s : * 
geneseqp2000 : * 
geneseqp2001 : * 
geneseqp2002 : * 
geneseqp2003a: * 
geneseqp2003b: * 
geneseqp2004a : * 
geneseqp2004b: * 
10: geneseqp2005 : * 
11: geneseqp2006 : * 
12: geneseqp2007: * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed. 
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and is derived by analysis of the total score distribution. 

SUMMARIES 

Result Query 



0 . 


core 


Match 


eng 




ID 


Description 




1807 


100 


0 


340 




AAW6 8520 


Aaw68520 Salmonid— 






100 


0 






AAY 174 40 


Aayl7440 SB transp 




807 


100 


0 






AAY 72024 


Aay72024 SB (Sleep 






100 


0 






AAG80595 






1807 


100 


0 






ABBO 75 70 


Abb07570 SB transp 




1807 


100 


0 


340 




ABB99 7 1 9 






1807 


100 


0 






ADD4 1 924 


Add41924 Amino aci 




1807 


100 


0 


340 


10 


ADW2 26 78 


Adw226 78 Sleeping 


9 


1807 


100 


0 


340 


10 


ADW03280 


Adw03280 Sleeping 


10 


1807 


100 


0 


340 




AEF8 1 669 






1807 


100 


0 


340 


12 


AEP4 7151 


Aep47151 SB"'"transi 


12 


1802 


99 


7 


340 




ADKl 7 4 80 


Adkl7480 Salmonid 




1787 


98 


9 


340 




AEF8 1683 


Aef81683 Salmonid 


4 




98 


8 


340 




AAB 47211 




15 


1786 


98 


8 




10 


ADW46 561 


^Ad^46561^S^l^^"'^d 


16 


1786 


98 


8 


340 


12 


AEL92393 


Ael92393 Sleeping 


17 


1784 


98 


7 






AEF8 1 684 


Aef81684 Salinonid 


18 


1783 


98 


7 


340 


10 


ADW46 599 




19 


1783 


98 


7 


340 


10 


ADW4 6 6 49 


Adw46649 Salmonid 


20 


1783 


98 


7 


340 


10 


ADW46 612 


Adw46612 Salmonid 


21 


1783 


98 


7 


340 


10 


ADW46 621 


Adw46621 Salmonid 




1783 


98 


7 




10 


ADW4 6613 




23 


1782 


98 


6 


340 


10 


ADW46 626 


Ad^45626 2^^"^°"^^ 


24 


1782 


98 


6 


340 


10 


ADW46 644 


Ad"46644 S^l'"°"'"d 




1782 


98 


6 




10 


ADW46 614 


Ad"46614 S^l'"°"^d 




1782 


98 


6 


340 




ADW46 597 


Ad^46597 S^l"'°"^d 


27 


1782 


98 


6 






ADW46 587 


Ad^46587 S^l"'°"^d 


28 


1781 


98 


6 


340 


10 


ADW46656 


Adw46656 Salmonid 


29 


1781 


98 


6 


340 


10 


ADW46591 


Adw46591 Salmonid 






98 


6 






ADW46 643 


aH^4fifi^ft Salmonid 






98 


6 






ADW46 658 




32 


1781 


98 


6 


340 


10 


ADW4 6 588 






1781 


98 


6 


340 


10 


ADW46 605 




34 


1781 


98 


6 


340 


10 


ADW4 6 6 03 


Adw46603 Salmonid 


35 


1781 


98 


6 


340 


10 


ADW46653 


Adw46653 Salmonid 


36 


1781 


98 


6 


340 


10 


ADW46607 


Adw46607 Salmonid 


37 


1781 


98 


6 


340 


10 


ADW46615 


Adw46615 Salmonid 


38 


1781 


98 


6 


340 


10 


ADW46617 


Adw46617 Salmonid 


39 


1781 


98 


6 


340 


10 


ADW46590 


Adw46590 Salmonid 


40 


1781 


98 


6 


340 


10 


ADW46593 


Adw46593 Salmonid 


41 


1781 


98 


6 


340 


10 


ADW46608 


Adw46608 Salmonid 


42 


1781 


98 


6 


340 


10 


ADW46645 


Adw46645 Salmonid 


43 


1781 


98 


6 


340 


10 


ADW46601 


Adw46601 Salmonid 


44 


1781 


98 


6 


340 


10 


ADW46610 


Adw46610 Salmonid 


45 


1781 


98 


6 


340 


10 


ADW46630 


Adw46630 Salmonid 



ALIGNMENTS 



RESULT 1 
AAW68520 

ID AAW68520 standard; protein; 340 AA. 
XX 
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AAW68520; 




XX 






DT 


27-AUG-2003 


(revised) 


DT 


25-JAN-1999 


(first entry) 








DE 


Salmonid-type Tcl-like transposase protein. 


XX 






KW 


Transposase; 


salmonid; Tcl-like family; homology; vertebrate; transposon; 


KW 


zebrafish; gene transfer; gene therapy; inverted repeat; direct repeat; 


KW 


genome . 




XX 






OS 


transposons . 




XX 






FH 


Key 


Location /Qualifiers 


FT 


Domain 


11 . .61 


FT 




/note= "Paired-like domain with Leucine Zipper" 


FT 


Domain 


104 . .105 






/note= "form part of a bipart nuclear localisation 


FT 




signal" 


FT 


Domain 


116. .120 


FT 




/note= "form part of a bipart nuclear localisation 


FT 




signal" 


FT 


Domain 


151. .161 


FT 




/note= "DD(34)E box" 


FT 


Region 


183. .198 


FT 




/note= "Glycine-rich box" 


FT 


Domain 


241. .251 


FT 




/note= "DD(34)E box" 


FT 


Domain 


268. .282 


FT 




/note= "DD(34)E box" 


XX 






PN 


WO9840510-A1 




XX 






PD 


17-SEP-1998. 




XX 






PF 


ll-MAR-1998; 


98WO-US004687. 


XX 






PR 


ll-MAR-1997; 


97US-0040664P. 


PR 


28-JUL-1997; 


97US-0053868P. 


PR 


13-NOV-1997; 


97US-0065303P. 


XX 






PA 


(MINU ) UNIV 


MINNESOTA. 


XX 






PI 


Hackett PB, 


Ivies Z, Izsvak Z, Caldovic L; 


XX 






DR 


WPI; 1998-531525/45. 




N-PSDB; AAV33841. 


XX 






PT 


New isolated 


transposase gene - used to develop products for the 


PT 


integration of nucleic acid fragments into the nucleic acid of cells. 


PT 


particularly 


for gene therapy. 


XX 






PS 


Disclosure; Fig 2B; 79pp; English. 


XX 






cc 


This sequence represents a novel transposase protein of the salmonid-type 


cc 


Tcl-like family. The coding sequence was reconstructed based on sequence 


cc 


homology between members of the Tcl-like family of vertebrate 


cc 


transposons. 


especially based on the sequence of a number of fish, e.g. 


cc 


zebrafish transposons. The transposase can be used in a gene transfer 


cc 


method, especially for gene therapy treatment, by flanking a heterologous 


cc 


gene to be expressed in a cell with the inverted or direct repeat 
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'C sequences to which the transposase binds. This allows the transposase- 

'C mediated transposition of the heterologous into the genome of the host 

:C cell. (Updated on 27-AUG-2003 to correct OS field.) 

:x 

Q Sequence 340 AA; 

Query Match 100.0%; Score 1807; DB 2; Length 340; 

Best Local Similarity 100.0%; Pred. No. 2.4e-156; 

Matches 340; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

ly 1 MGKSKEISQDLRKKIVDLHKSGSSLGAISKRLKVPRSSVQTIVRKYKHHGTTQPSYRSGR 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

lb 1 MGKSKEISQDLRKKIVDLHKSGSSLGAISKRLKVPRSSVQTIVRKYKHHGTTQPSYRSGR 60 

!y 61 RRVLSPRDERTLVRKVQINPRTTAKDLVKMLEETGTKVSISTVKRVLYRHNLKGRSARKK 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
lb 61 RRVLSPRDERTLVRKVQINPRTTAKDLVKMLEETGTKVSISTVKRVLYRHNLKGRSARKK 120 

ly 121 PLLQNRHKKARLRFATAHGDKDRTFWRNVLWSDETKIELFGHNDHRYVWRKKGEACKPKN 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
lb 121 PLLQNRHKKARLRFATAHGDKDRTFWRNVLWSDETKIELFGHNDHRYVWRKKGEACKPKN 180 

ly 181 TIPTVKHGGGSIMLWGCFAAGGTGALHKIDGIMRKENYVDILKQHLKTSVRKLKLGRKWV 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
lb 181 TIPTVKHGGGSIMLWGCFAAGGTGALHKIDGIMRKENYVDILKQHLKTSVRKLKLGRKWV 240 

ly 241 FQMDNDPKHTSKWAKWLKDNKVKVLEWPSQSPDLNPIENLWAELKKRVRARRPTNLTQL 300 

I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
lb 241 FQMDNDPKHTSKWAKWLKDNKVKVLEWPSQSPDLNPIENLWAELKKRVRARRPTNLTQL 300 

ly 301 HQLCQEEWAKIHPTYCGKLVEGYPKRLTQVKQFKGNATKY 340 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
lb 301 HQLCQEEWAKIHPTYCGKLVEGYPKRLTQVKQFKGNATKY 340 



)://es/ScoreAccessWeb/GetItem.action?AppId=10563896&seqId=09323b67806f62ad&... 



4/25/2008 



